The Thinning Problem in Arabic Text Recognition – A Comprehensive Review

نویسندگان

  • Atallah M. Al-shatnawi
  • Khairuddin Omar
چکیده

The goal of this paper is to present an overview about the thinning problem in Arabic text recognition. Thinning "Skeletonization" is a very crucial stage in the ACR, it simplifies the text shape and reduces the amount of data that needs to be handled and it is usually used as a pre-processing stage for recognition and storage systems. The skeleton of Arabic text can be used for each of the baseline detection, character segmentation, and features extraction and also ultimately supporting the classification. Choosing or designing the effective thinning algorithm for Arabic text is crucial in ACR. In this paper, the importances of the thinning for the ACR and the usage of the text skeleton in ACR system are discussed and presented. As well as the challenges that have an impact on the thinning of Arabic text are discussed. The methods of Arabic text thinning are discussed and reviewed based on the technique used, and the methods advantages and drawbacks are discussed in details.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

نقد کتاب پژوهشی (ادبیــات) /به فرهنگ باشد روان تندرست: نقدی بر کتاب فرهنگ واره لغات و ترکیبات عربی شاهنامه، هوشنگ محمدی افشار

The latest comprehensive and detailed research on the recognition, description, and the etymology of the Arabic lexicon of Shahnameh is the dictionary of Arabic words and Expressions of Shahnameh, written by Dr. Sajjad Aydanlou. This book is based on the second edition of the Correction of the Khaleghi Motlagh Shahnameh (1393) which is the most authoritative correction and the closest to the or...

متن کامل

A Tool to Develop Arabic Handwriting Recognition System Using Genetic Approach

Problem statement: Significant movement has been made in handwriting recognition technology over the last few years. Up until now, Arabic handwriting recognition systems have been limited to small and medium vocabulary applications, since most of them often rely on a database during the recognition process. The facility of dealing with large database, however, opens up many more applications. A...

متن کامل

High capacity steganography tool for Arabic text using 'Kashida'

Steganography is the ability to hide secret information in a cover-media such as sound, pictures and text. A new approach is proposed to hide a secret into Arabic text cover media using "Kashida", an Arabic extension character. The proposed approach is an attempt to maximize the use of "Kashida" to hide more information in Arabic text cover-media. To approach this, some algorithms have been des...

متن کامل

Region growing based segmentation algorithm for typewritten and handwritten text recognition

This paper presents a new technique of high accuracy to recognize both typewritten and handwritten English and Arabic texts without thinning. After segmenting the text into lines (horizontal segmentation) and the lines into words, it separates the word into its letters. Separating a text line (row) into words and a word into letters is performed by using the region growing technique (implicit s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015